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Introduction 

Environmental  estrogens  have  been  implicated  as  etiological  agents  for  breast  cancer. 
This  study  is  aimed  at  identifying  mechanistic  information  regarding  the  estrogenicity  and 
carcinogenicity  of  environmental  estrogens  through  structure-activity  relationship  (SAR)  and 
quantitative  SAR  (QSAR)  modeling.  To  facilitate  this  investigation,  the  computer-based 
expert-system  MULTICASE  is  used  in  addition  to  other  molecular  modeling  techniques. 
Learning  sets  have  been  derived  from  bioassays  designed  to  detect  estrogenic  chemicals. 
These  databases  include  an  in  vitro  estrogen  competitive  binding  assay,  an  in  vitro  MCF-7 
cell  proliferation  assay  and  a  whole  animal  uterine  weight  increase  assay  (currently  being 
derived).  The  SAR  and  QSAR  models  derived  from  these  databases  are  considered  in 
conjunction  with  preexisting  SAR  and  QSAR  models  for  rodent  carcinogenicity  as  well  as 
for  other  toxicological  phenomena  to  identify  features  that  are  common  to  estrogenicity  and 
carcinogenicity. 

Body 

The  past  year  has  been  very  exciting  because  I  was  awarded  my  Ph.D.  As  part  of  the 
rigors  of  writing  my  dissertation  and  preparing  for  the  defense,  I  did  not  have  the  necessary 
time  to  commit  to  the  Completion  of  this  research  project.  I  have  been  granted  a  one  year 
extension  by  the  USAMRMC.  It  must  be  strongly  noted  that  due  to  the  support  through 
this  project,  I  was  able  to  excel  in  my  academic  pursuits.  Moreover,  research 
conducted  to  date  as  part  of  this  project  have  laid  the  groundwork  for  further  research 
opportunities.  Additionally,  a  position  of  assistant  professor  was  offered  to  me  from  the 
department  from  which  I  graduated.  As  part  of  accepting  this  offer,  I  have  verified  that  I 
will  have  the  time  and  resources  for  this  project’s  successful  completion.  I  believe  that 
this  position  was  offered  to  me,  in  part,  due  to  the  University  of  Pittsburgh’s 
understanding  of  the  importance  of  breast  cancer  research  as  well  as  my  particular  area 
of  interest  in  computational  toxicology  and  its  successful  applications  to  the 
investigation  into  the  etiology  of  breast  disease. 

To  date,  SAR  models  have  been  created  and  published  describing  mouse  [1]  and  rat  [2] 
carcinogens.  These  models  will  serve  as  the  basis  for  further  comparisons  for  the 
estrogen  SAR  models. 

Previously  in  our  laboratory,  we  had  a  standard  method  for  assessing  the  structural  and 
mechanistic  similarity  between  various  SAR  models.  A  series  of  comparisons  between 
diverse  SAR  models  is  conducted.  CASE  biophores  derived  from  several  SAR  models 
are  used.  Basically,  structural  overlap  is  defined  as  a  biophore  from  one  SAR  model 
being  identical  to  a  biophore  in  another  SAR  model  or  being  embedded  in  a  biophore 
from  another  SAR  model.  For  example,  an  overlap  would  be  noted  in  either  of  the 
following  cases;  CI-CH2  from  model  1  is  identical  to  CI-CH2  from  model  2,  or  CH=N- 
from  model  1  is  imbedded  in  CH=N-C=CH-  from  model  2.  For  statistical  analyses, 
overlap  with  a2p-induced  nephropathy,  inhibitors  of  cytochrome  P450  2D  and  binding  to 
cytosolic  Ah  receptor  have  served  as  the  null  hypothesis  in  that  minimal  mechanistic 
similarity  was  expected  to  exist  between  these  SAR  models  and  carcinogenesis.  Thus 
SAR  models  that  contain  significantly  more  similar  features  than  expected  can  be 
considered  as  having  mechanistic  similarities.  These  relationships  can  cover  several 
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levels  of  biological  complexity  and  perhaps  ease  in  the  identification  of  underlying 
mechanisms.  For  example,  an  SAR  relationship  has  been  identified  between 
Salmonella  mutagenicity  and  carcinogenicity  [3].  Arguably,  it  is  easier  to  understand  the 
mechanistic  basis  of  the  induction  of  mutations  in  Salmonella  at  the  molecular  level  than 
the  molecular  basis  of  cancer  induction.  Although  based  upon  analyses  of  the 
mutagenic  spectrum,  one  might  hypothesize  that  in  carcinogenesis,  the  molecular  basis 
is  similar  (i.e.,  mutation  of  a  suppressor  gene  (e.g.  p53)). 

However,  we  devised  a  more  informative  method  to  investigate  mechanistic  phenomena 
[4].  A  copy  of  this  manuscript  is  included  at  the  end  of  this  report.  Basically,  instead  of 
simply  analyzing  substructure  overlap  between  various  toxicological  phenomena,  we 
sought  to  develop  a  method  richer  in  information  that  describes  these  phenomena. 

Thus,  we  used  the  full  capabilities  of  the  CASE/MULTICASE  system  to  develop  a  more 
information  intensive  approach  to  explore  mechanistic  similarities  between 
biological/toxicological  phenomena.  It  is  expected  that  this  new  approach  will  be  very 
successful  in  analyzing  the  relationship  between  estrogenicity  and  carcinogenicity.  This 
approach  has  been  successfully  used  in  exploring  the  relationships  between  inhibition 
of  gap  junctional  intracellular  communication  with  other  toxic  phenomena  [5]. 

The  remainder  of  this  report  will  give  details  for  each  task  described  in  the  statement  of 
work. 

Task  1:  Development  of  criterion  for  the  acceptance  and  inclusion  of  experimental 
data  in  databases.  To  obtain  this  goal,  the  expertise  of  Drs.  Billy  Day  and 
Marjorie  Romkes  will  be  utilized. 

Task  2:  Literature  Search  and  collection  of  data  to  develop  estrogen  receptor 
competitive  binding  assay  database. 

Tasks  1  and  2  proved  to  be  difficult,  time  consuming  and  very  rewarding.  Over  150 
published  articles  were  collected  that  contained  data  pertaining  to  the  ability  of  various 
chemicals  to  bind  to  the  estrogen  receptor  (ER).  The  quantitative  measure  here  is 
expressed  as  relative  binding  to  affinity  (RBA)  to  the  ER  relative  to  estradiol.  Some 
reports  covered  one  chemical  while  many  included  a  large  series  of  chemicals.  Overall, 
ER  RBAs  were  gather  from  approximately  1400  chemical  experiments.  Portions  of 
these  have  been  tested  multiple  times  by  different  researchers.  These  data  will  be  used 
to  assess  the  overall  consistency  of  the  method.  At  first  pass,  the  data  seem  to  be 
consistent. 

Originally,  one  learning  set  and  SAR  model  was  anticipated  for  ER  RBA  data. 
However,  researchers  have  used  in  excess  of  seven  different  test  species/systems  to 
measure  RBA.  These  include  using  ERs  from  mice,  rats,  lambs/sheep,  calf,  human 
uterus,  MCF-7  cells,  and  clonal  assays.  It  was  concluded  that  multiple  models  would  be 
made  for  the  endpoints  with  the  most  data.  These  are  calf  (n=370),  rat  (n=354)  and 
mouse  (n=168).  An  additional  problem  encountered  was  that  different  researchers  used 
different  temperatures  to  measure  RBA  (e.g.,  from  0C  to  37C).  Where  data  existed  for 
one  chemical  being  tested  at  multiple  temperatures,  there  were  often  large  differences 
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observed  between  cold  and  warm  conditions.  In  order  to  address  the  potency  of  these 
chemicals  during  QSAR  analysis,  we  derived  a  regression  equation  that  is  capable  of 
adjusting  for  the  temperature  variation.  All  models  are  adjusted  to  OC  for  consistency. 

A  small  amount  of  data  has  been  gathered  for  human  ER  RBA.  Unfortunately,  there  is 
not  enough  to  derive  an  SAR  model.  However,  these  data  will  be  able  to  serve  as 
outside  validation  sets  to  see  how  well  the  derived  SAR  models  can  assess  chemical 
activity  in  human  systems. 

Task  3:  Literature  Search  and  collection  of  data  to  develop  estrogen-dependent 
growth  (e.g.,  MCF-7  cell  line)  database. 

To  date,  two  learning  sets  and  SAR  models  have  been  derived  for  MCF-7  estrogen- 
dependent  growth.  The  data  are  all  taken  from  the  publications  of  Dr.  Anna  Soto.  This 
assures  there  are  no  interlaboratory  differences.  The  first  model  is  for  relative 
proliferative  potency  (RPP)  (n=1 22),  which  measures  the  amount  of  a  test  chemical 
needed  to  maximally  stimulate  MCF-7  growth.  The  second  model  is  relative 
proliferative  effect  (RPE)  (n=122),  which  measures  how  much  cell  growth  a  chemical 
can  induce  compared  to  estradiol. 

Task  4:  Literature  Search  and  collection  of  data  to  develop  mouse  uterine  weight 

gain  database. 

Approximately  20  papers  have  been  collected  with  data  on  approximately  100 
chemicals  for  their  ability  to  stimulate  uterine  weight  gain.  Although  the  original  task  is 
for  mouse  only,  this  is  being  broadened  to  include  rat  data  as  well.  Moreover,  this 
assay  is  also  used  to  measure  antiestrogenicity  by  measuring  if  a  chemical  can  inhibit 
the  stimulator  effect  of  estradiol.  If  sufficient  data  are  available,  an  SAR  model  will  also 
be  created  for  antiestrogenicity.  It  is  expected  that  these  models  will  be  derived  within 
two  months. 

Task  5:  Creation  of  MUL  TICASE  databases  associated  with  tasks  2-4. 

SAR  models  have  been  created  for  RBA  data  (Task  2)  and  MCF-7  data  (Task  3).  The 
SAR  model  for  uterine  weight  gain  is  in  development. 

Task  6:  Validate  the  predictivity  of  each  database  in  conjunction  with  MULTICASE. 
Validation,  as  used  here,  depicts  how  well  the  derived  models  are  able  to  predict  the 
activity  (i.e.,  estrogen icity)  of  chemicals  not  contained  in  the  model.  These  models  can 
therefore  be  used  to  assess  the  activity  of  environmental  chemicals,  including 
phytoestrogens.  Of  the  models  derived  (Table  1),  it  can  be  seen  that  they  are  all  over 
70%  predictive.  These  are  very  acceptable  results  and  on  the  order  achieved  for  other 
toxicological  SAR  models.  The  remaining  models  should  take  less  than  one  month  to 
validate. 
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Table  1.  Validation  results  for  various  estrogenicity  SAR  models. 


Model 

Sensitivity 

Specificity 

Concordance 

Calf  ER  RBA 

0.73 

0.71 

0.72 

Rat  ER  RBA 

N/a 

N/a 

N/a 

Mouse  ER  RBA 

N/a 

N/a 

N/a 

MCF-7  RPP 

0.72 

0.81 

0.77 

MCF-7  RPE 

0.80 

0.72 

0.76 

Uterine  weight 

N/a 

N/a 

N/a 

Task  7:  MUL  TICASE  structural  analyses  of  each  database. 

The  databases  listed  in  Table  1  have  undergone  initial  structural  analysis.  The  remainder 
should  be  done  in  less  than  one  month. 

Task  8:  Identification  of  congeneric  sets  of  estrogens  that  may  act  through  similar 
mechanisms.  Investigate  various  attributes  of  chemicals  within  each  set 
to  determine  if  various  structural  or  physical-chemical  properties  modulate 
the  activity  of  these  chemicals  within  each  database. 

To  be  completed. 

Task  9:  Identify  structural  similarities  between  the  three  estrogenicity  databases  to 
determine  whether  the  databases  are,  in  fact,  measuring  the  same 
phenomena  (i.e.,  estrogenicity). 

To  be  completed.  This  task  obviously  has  been  expanded  to  include  the  additional  SAR 
models  discussed  above.  Moreover,  specific  similarities  will  be  assessed  for  the  individual 
SAR  model  included  under  ER  RBA,  MCF-7,  and  uterine  weight  increase. 

Task  10:  Identify  structural  similarities  between  the  estrogenicity  database  and 
extend  MULTICASE  database  for  carcinogenicity,  toxicity  and  genotoxicity 
to  identify  possible  similarities  in  structural  determinants  and  mechanisms 
of  action. 

To  be  completed.  This  task  has  been  expanded  to  include  the  new  method  for  exploring 
mechanistic  relatedness  between  various  toxicological  endpoints  described  above. 

To  conclude,  no  insurmountable  obstacles  exist  and  the  completion  of  the  work  required  for 
this  project  should  be  accomplished  by  the  end  of  summer.  Several  papers  describing  the 
results  of  this  work  will  be  submitted  for  review  shortly  thereafter.  It  is  anticipated  that  this 
entire  project  will  be  completed  by  December  1 999. 
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